Whose Nickname is This? Recognizing Politicians from Their Aliases
نویسندگان
چکیده
Using aliases to refer to public figures is one way to make fun of people, to express sarcasm, or even to sidestep legal issues when expressing opinions on social media. However, linking an alias back to the real name is difficult, as it entails phonemic, graphemic, and semantic challenges. In this paper, we propose a phonemic-based approach and inject semantic information to align aliases with politicians’ Chinese formal names. The proposed approach creates an HMM model for each name to model its phonemes and takes into account document-level pairwise mutual information to capture the semantic relations to the alias. In this work we also introduce two new datasets consisting of 167 phonemic pairs and 279 mixed pairs of aliases and formal names. Experimental results show that the proposed approach models both phonemic and semantic information and outperforms previous work on both the phonemic and mixed datasets with the best top-1 accuracies of 0.78 and 0.59 respectively.
منابع مشابه
Aliases and Ambiguity: A case study of gene aliases, and implications for information curation and AI
This research seeks to understand how names and aliases of concepts are used in scientific literature. Natural language processing tools, and data curation in general, depend upon unique concept identifiers for information, and aliases only provide more oppurtunity for ambiguiyt; despite this, aliases seem to persist in literature and daily life. As a case study, gene names are analyzed. This a...
متن کاملEffective Branch Prediction through Caching of Aliasing Branches
High performance CPUs constantly face obstacles in pipelining delays from conditional branches to reach their expected potential. Precise branch prediction is required to overcome this performance limitation imposed on high performance architecture and is the key to many techniques for enhancing and exploiting Instruction-Level Parallelism (ILP). In general, prediction accuracy can be improved ...
متن کاملAutomatically Extracting Personal Name Aliases from the Web
An entity can be referred by multiple name aliases on the web. Extracting aliases of an entity is important for various tasks such as identification of relations among entities, automatic metadata extraction and entity disambiguation. To extract relations among entities properly, one must first identify those entities. Aliases of an entity are useful as metadata for that entity and can be used ...
متن کاملNicknames and the Lexicon of Sports
This article examines the structure and usage of nicknames given to professional hockey and baseball players. Two general types are observed: a phrasal referring expression and a single-word hypocoristic. The phrasal nickname is descriptive but is only used referentially, usually in sports narrative. The hypocoristic is used for both reference and address and may be descriptive or shortened fro...
متن کاملAligning Entity Names with Online Aliases on Twitter
This paper presents new models that automatically align online aliases with their real entity names. Many research applications rely on identifying entity names in text, but people often refer to entities with unexpected nicknames and aliases. For example, The King and King James are aliases for Lebron James, a professional basketball player. Recent work on entity linking attempts to resolve me...
متن کامل